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Abstract 

Time-evolving interaction patterns studied in different contexts can be well represented 
by so called temporal networks, i.e. networks in which nodes are intermittently connected. 
In this manuscript we introduce the notion of betweenness preference in temporal networks. 
It captures how likely a certain node is to mediate interactions between particular pairs of 
its neighboring nodes. We argue that betweenness preference is an important correlation 
to consider in the analysis of temporal network data. In particular, it allows to assess to 
what extent paths, existing in time-aggregated, static representations of temporal networks, 
are actually realizable based on the underlying sequence of interactions. We argue that 
betweenness preference correlations are present in empirical data sets. We further show that 
neglecting betweenness preference will lead to significantly wrong statements about dynamic 
processes on temporal networks. 

Many complex systems exhibit dynamically changing interaction topologies. In order to simplify 
their analysis such systems are typically studied using static, time-aggregated networks. Recent 
works have argued that properties of dynamic processes evolving on complex networks change 
significantly when the dynamics of the network topology is taken into account |T]. It has been 
observed that e.g. spreading processes evolve slower on such temporal networks [2j El S] • As 
one explanation for this slowdown, the presence of bursty node activity patterns has been sug- 
gested |5j 12 |7]. Focusing on structural properties of temporal networks, rather than on patterns 
of node activity, in this manuscript we study to what extent nodes preferentially mediate infor- 
mation flows between particular pairs of neighboring nodes. We propose a measure, betweenness 
preference, to study this preferential mediation. Our study is motivated by the idea that in many 
networks nodes contact particular other nodes based on the previous contact. One example is 
the influence of context in information dissemination: work-related emails are more likely to be 
forwarded to a work-related subset of social contacts. Another example are spatial constraints 
in disease spreading: an infected individual will not infect all of her (aggregated) social contacts 
at once. Rather there will be a sequence of contacts, based on the individual's mobility pattern. 
Here we study the influence of such special classes of dynamical contact patterns arising in tem- 
poral networks. We argue that betweenness preference a) is not captured in the time-aggregated 
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network, b) is present in empirical temporal network data and c) critically influences dynamical 
processes evolving on temporal networks. 

The concept of temporal networks intends to overcome the limitations of time-aggregated network 
representations. A temporal network is defined as a tuple consisting of a set of nodes v £ V as 
well as a set of events: e(v, w,t,l • At) £ E. An event is an interaction between two nodes v and 
w, starting at time t and with a duration I ■ At. Here the duration is relative to some smallest 
unit of discrete time At. Based on the time-stamped edges and a discrete notion of time one can 
construct a flow-preserving static representation of temporal networks by unfolding time into an 
additional topological dimension. This construction serves as the basis for our models. We call 
it a time-unfolded network. The time-unfolded networks of two different temporal networks are 
depicted in the top part of Fig. [T] In the resulting temporal unfolding, we indicate the presence 
of a possible flow event by an edge (vt, itftfAt); while replacing the original node set V by a 
set V' of temporal copies of nodes vt where v 6 V and t £ {0, At, . . . , LAt} for an observation 
period of length L ■ At. Throughout the paper, we assume At = 1 for simplicity. We would 
like to highlight that similar constructions have been used in the study of temporal networks 
before [8j[9j[l0]. As can be seen in Fig. [T] (top), the two different temporal networks are the same 
in the time-aggregated representation GAgg (bottom) . In GAgg edge weights indicate the number 
I of discrete time steps in which a particular edge has been active throughout the observation 
period. In analogy to Statistical Mechanics, one might think of such a time-aggregated network 
as a macro-state which is compatible with different micro-states, i.e. temporal networks. 

Betweenness Preference An important aspect when studying dynamical processes like 
spreading or synchronization on static, time- aggregated networks is that one assumes transi- 
tive paths. However, this transitivity does not necessarily hold in a (empirical) temporal network 
that gives rise to the respective time- aggregated network. To illustrate this fact, consider the 
time-aggregated network GAgg depicted in Fig. [T] If it would represent a static system, infor- 
mation could spread in a transitive way from b via e to g. However, in a temporal network the 
order in which edges appear imposes an additional constraint: information can only flow along 
time-respecting paths [2J. Hence, in a temporal network underlying GAgg, information can only 
flow from b to g if there is a time-respecting path, i.e. the connection (b, e) is followed by a con- 
nection (e,g). Accordingly, even though the links (b,e) and (e, g) are present in both temporal 
networks (Fig. [I] top), a time-respecting path between b and g only exists in the left example. In 
order to capture this transitivity-limiting property of temporal networks in a quantitative way, 
we study whether certain time-respecting paths are preferentially realized as compared to the 
time-aggregated perspective. Focusing on the most elementary building block of a time-respecting 
path, we particularly study what we call a two-path, i.e. a path of length two, representing two 
consecutive edge activations that interconnect three nodes. The statistics of this basic element 
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Figure 1: Time-aggregated weighted network Gaqq (bottom) and time-unfolded network of two 
different temporal networks Goym and GDyn 2 (top), both of which are consistent with Gaqq- 

will reveal whether, and to what extent, path-transitivity holds in the temporal network. Based 
on the introduced time-unfolded representation of temporal networks, we define the elements of 
a per-node betweenness preference matrix B v (i) as follows [^J 

1, if (s t -i,v t ) € E and (v t ,d t +i) G E 
0, otherwise 



B v sd (t) := 



1) 



Each matrix element BVJt) captures whether node v% in a time-unfolded temporal network was 
in between a source st-i and a destination dt+\ on a two-path (st-i,Vt) — (vf,dt+{)- Based on 



1 This definition builds on a notion of time-respecting paths that are comprised of edge activations following 
each other immediately. However, this restriction can be relaxed easily by introducing self-edges (v t ,vt+i) for all 
nodes v and times t. 
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this, we define the elements of a time- aggregated betweenness preference matrix B u : 

-l 



B sd '■= A ffirift) 



(2) 



and a normalized betweenness preference matrix ~P V : 



pv r>D 



s'd' 



(3) 



Essentially, P". is the probability distribution of the two-paths (st-i,Vt) — (yt,dt+i) over all 
t. We use this to quantify to what extent v exhibits a preference to interconnect particular 
pairs of source and target nodes. Based on the concept of mutual information, we define a new 
temporal-network measure betweenness preference as 



(4) 



deD 
ses 



where P v (s) = Yld^sd an< ^ P v (d) = Yls^sd- ^ n g enerai ) I V (S;D) captures to what extent 
the knowledge of the source s of a temporal path through v determines the next step d. Or, 
within a context of information flow, it measures how selective v is in mediating information 
preferentially between certain pairs of nodes s and d. We note that betweenness preference, as 
a mutual information measure, is minimal if the random variables S and D are independent. 
This allows us to calculate the matrix elements P^ d resulting in I V (S; D) = solely based on the 
underlying static, time-aggregated network with edge weights ivy. 



P v (s,d) :=p v (s)-p v (d). 



(5) 



where p v (s) = w sv [J2 S w sv ] and p v (d) = w v d[Y^d w vd] ■ For the time-aggregated network 
given in Fig. [T] the joint probability for S and D that results in I e (S; D) = is given as 
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We will now introduce a simple configuration model to generate temporal networks that are 
members of a temporal network ensemble defined by a given normalized betweenness preference 
matrix. Here we will limit ourselves to a subset of possible realizations, in which there is always 
one edge active per time step and there are only paths of length two realized (as an example 
consider top left in Fig. [2]). The model creates a temporal network from a given P v as follows: 
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Figure 2: Time- unfolded networks of two temporal networks. Both are consistent with G Agg 
(Fig. [TJ. Left: Unfolding of randomly generated microstate; I e (S;D) = 0.219. Right: Unfolding 
of microstate with high betweenness preference; I e (S;D) = 1.295. 



First, we define the number of two-paths ./V2 to be realized (which is equivalent to defining the 
simulation time T). Second, we draw a random two-path (s, v) — (v, d) according to p(s, v, d) = 
Psdl 'Ylv Psd- Third, we create temporal network edges s(t) — > v(t + 1) and v(t + 1) — > d(t + 2). 
We increment t = t+ 1 and n<i = 71-2 + 1 (the number of realized two-paths), go to the second step 
and iterate until ri2 = N2. Having empirical temporal network data available, we use this model 
to create microstates that (i) preserve betweenness preference as well as the macrostate, (ii) 
destroy other correlations (such as bursty node activations). We call the so constructed temporal 
network the betweenness preference preserving case. Using the configuration model described 
above, it is also straight forward to construct microstates with low betweenness preference based 
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Figure 3: Betweenness preference distribution for (A) a temporal network of the uncorrelated 
case, (B) the empirical RealityMining data and (C) a P v preserving case. All temporal networks 
have the same length of N2 = 50000 two-paths. (A) and (C) are based on the same empirical 
data as (B). 



on a given macrostate, by using the probability P v (s,d) defined in eq. (J5J). We call a temporal 
network created in such a way the uncorrelated case, since it only preserves the macrostate, but 
destroys betweenness preference. To be precise, the uncorrelated case has betweenness preference 
that is expected from a random microstate of finite length. (In a temporal network of infinite 
length the model would reproduce the limiting case of I V (S; D) = 0.) 



Empirical Results As evidence that betweenness preference is an important property in real- 
world data sets, we show results based on empirical temporal contact networks of the Reali- 
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tyMining Project Here we use a one-week subset of the contact network (Sept. 8th to 15th 
2004), featuring 64 individuals with 20000 recorded interactions. For details and freely available 
data see |llj . In Fig. [3] we present the distribution of betweenness preference present in (A) an 
uncorrelated case, (B) the original data sample and (C) a betweenness preference preserving case. 
All temporal networks have equal length of N2 = 50000 two-paths. The uncorrelated (A) and 
correlated (C) case have been created using the empirical data using the configuration model de- 
scribed above. As expected, in the uncorrelated case (A) there is a clear spike around I(S; T) = 0, 
indicating that most nodes show only small betweenness preference. The theoretical expectation 
of I(S; T) = for all nodes in the uncorrelated case is not realized due to finite length of the 
temporal sequence. Observing betweenness preference in the empirical temporal network (B), one 
realizes that the distribution is very different from the one in (A). Especially, the distribution 
in (B) is rather broad, with an average (I(S;T)) = 1.9 and a median Qo^(I(S;T)) = 1.99, as 
compared to (I(S;T)) = 0.9 and Qo.5(I(S;T)) = 0.5 in (A). This leads to the conclusion that 
there is a significant amount of betweenness preference in the empirical contact sequence, that 
would be missed by assuming the uncorrelated case (A). Hence indicating that there clearly is less 
path transitivity in empirical temporal network data than would be assumed by the uncorrelated 
model. 

In the third panel (C) we show the betweenness preference distribution of the betweenness pref- 
erence preserving case. With (I(S;T)) = 2.04 and median Qq^(I(S;T)) = 2.10, distributions 
(B) and (C) are very similar. Since we create the model in a statistical fashion based on the 
normalized betweenness preference matrix ~P V , the two distributions are not completely identi- 
cal due to finite N2. Performing the two-sided Kolmogorov-Smirnov test, we cannot reject the 
hypothesis that the two distributions are identical with p = 0.66. Hence, in this case the model 
preserves the actual betweenness preference of the real network, whereas all other correlations 
(e.g. bursty node activities) are destroyed. 

Spreading Dynamics in Temporal Networks Turning our attention to the influence of 
betweenness preference on dynamical processes evolving on temporal networks, we study the 
dynamics of the paradigmatic SI (Susceptible-Infected) epidemic model. In order to exclusively 
focus on the impact of betweenness preference correlations, we compare the spreading dynamics 
on temporal networks created with the uncorrelated model case (A) , and the betweenness prefer- 
ence preserving case (C). Additionally to the empirical social contact data, we also consider large 
synthetic networks that show similar modularity, cluster structure and density as the empirical 
data [12j, and whose nodes have high betweenness preference]^] The time series of the average 
relative difference between the number of infected individuals A = (N u (t) — N p (t)) / (N u (t)) in 



2 The RealityMining project recorded time-stamped social contacts baed on proximity sensing technology, with 

a time-resolution of 5 minutes and over a period of 10 months in 2004/2005. 

3 For reproducibility, these artificial temporal networks are available at http://www.sg.ethz.ch/people/prene 
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Figure 4: Si-spreading dynamics for temporal networks of empirical RealityMining data (black) 
and 1000 node synthetic network (gray). Main figure shows time evolution of average relative 
difference A (40 realizations) in SI spreading dynamics between the uncorrelated and the be- 
tweenness preference preserving cases. Insets (left: empirical RealityMining data, right: 1000 node 
synthetic network) show time evolution of number of infected nodes for the uncorrelated (N u ) 
and betweenness preference preserving case (N p ). 

the uncorrelated and the betweenness preference preserving based on the empirical data is shown 
as the black curve in the main panel of Fig. [4] The solid and dashed curves in the left inset show 
the evolution of the number of infected individuals for the uncorrelated and the betweenness 
preference preserving cases respectively. For both cases, equally long temporal networks with 
150000 time steps (i.e. 50000 two-paths) have been created and the first active individual was 
initially infected. The spreading probability was set to p = 1. The number of infected individuals 
clearly follows a typical S-shaped curve (however, notice the semi-logarithmic scaling) in the 
uncorrelated N u as well as the betweenness preference preserving N p case. The slopes in the 
middle part of the infection dynamics are however clearly different, indicating slower spreading 
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in the temporal network with non- vanishing betweenness preference N p . This slow-down is es- 
pecially indicated by the time-to-saturation which, at least in the cases based on the empirical 
data (Fig. |4j left inset), is roughly one order-of-magnitude larger for N p . To substantiate this 
important effect of large betweenness preference, in the main panel we show the time evolution of 
the relative differences of infected individuals A. Our results clearly show that the uncorrelated 
model significantly overestimates the average number of infected individuals - at times by up to 
35%. Results for the larger synthetic networks (Fig. |4j gray curve and top right inset) confirm 
these findings and show evidence that in larger systems betweenness preference can have an even 
more pronounced effects on spreading processes: our simulations indicate that in larger networks 
with high betweenness preference the slow-down of spreading processes can be as large as = 80%. 
Interpreting A as the error made when not accounting for betweenness preference, it becomes 
obvious that taking a time- aggregated perspective of temporal networks, and hence neglecting 
betweenness preference, can lead to tremendously wrong statements about dynamical processes 
evolving on networks with dynamic topology. Betweenness preference, as a precise and easy-to- 
calculate measure, helps to quantify this possible pitfall and to decide whether a time-aggregated 
network perspective is sufficient. If the nodes of an empirical temporal network show no (or only 
very small) betweenness preference, realized time-respecting paths are statistically distributed 
as would be expected from a time- aggregated prespective. 
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